Efficient Yet Deep Convolutional Neural Networks for Semantic Segmentation
نویسندگان
چکیده
Semantic Segmentation using deep convolutional neural network pose more complex challenge for any GPU intensive work, as it has to compute million of parameters resulting to huge consumption of memory. Moreover, extracting finer features and conducting supervised training tends to increase the complexity furthermore. With the introduction of Fully Convolutional Neural Network, which uses finer strides and utilizes deconvolutional layers for upsampling, it has been a go to for any image segmentation task. We propose two segmentation architecture transferring weights from the popular classification neural net VGG19 and VGG16 which were trained on Imagenet classification dataset, transform all the fully connected layers to convolutional layers, use dilated convolution for decreasing the parameters, moreover we add more finer strides and attach four skip architectures which are concatenated with the deconvolutional layers in steps. We train on two stages, first with PASCAL VOC2012 training data and then with SBD training and validation set. With our model, FCN-2s-Dilated-VGG19 we yield better score for PASCAL VOC2012 test set with a meanIOU of 69 percent which is 1.8 percent better than FCN-8s. And with FCN-2s-Dilated-VGG16 we score a meanIOU of 67.6 percent. On the other hand our model consumes up to 10-20 percent less memory than FCN-8s for training with NVIDIA Pascal GPUs, making it more efficient and less memory consuming architecture for pixel-wise segmentation. Keywords— Deep Learning, Convolutional Neural Network, Semantic Image Segmentation, Skip Architectures;
منابع مشابه
A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images
The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...
متن کاملNeither Quick Nor Proper - Evaluation of QuickProp for Learning Deep Neural Networks
Neural networks and especially convolutional neural networks are of great interest in current computer vision research. However, many techniques, extensions, and modifications have been published in the past, which are not yet used by current approaches. In this paper, we study the application of a method called QuickProp for training of deep neural networks. In particular, we apply QuickProp d...
متن کاملIntegration of Deep Learning Algorithms and Bilateral Filters with the Purpose of Building Extraction from Mono Optical Aerial Imagery
The problem of extracting the building from mono optical aerial imagery with high spatial resolution is always considered as an important challenge to prepare the maps. The goal of the current research is to take advantage of the semantic segmentation of mono optical aerial imagery to extract the building which is realized based on the combination of deep convolutional neural networks (DCNN) an...
متن کاملDeep Neural Networks for Semantic Segmentation of Multispectral Remote Sensing Imagery
A semantic segmentation algorithm must assign a label to every pixel in an image. Recently, semantic segmentation of RGB imagery has advanced significantly due to deep learning. Because creating datasets for semantic segmentation is laborious, these datasets tend to be significantly smaller than object recognition datasets. This makes it difficult to directly train a deep neural network for sem...
متن کاملDeep learning and its application to medical image segmentation
One of the most common tasks in medical imaging is semantic segmentation. Achieving this segmentation automatically has been an active area of research, but the task has been proven very challenging due to the large variation of anatomy across different patients. However, recent advances in deep learning have made it possible to significantly improve the performance of image recognition and sem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1707.08254 شماره
صفحات -
تاریخ انتشار 2017